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1 .   INTRODUCTION 

Random  coefficient  models  and  ordinary  linear  models  with  non-Gaussian 

marginal  distributions  have  been  developed  for  a  variety  of  interesting 

situations  in  time  series  analysis  to  offer  viable  alternatives  to  the 

standard  Gaussian  assumptions  (see  Lewis,  1985,  for  a  survey).  Gastwirth  and 

Wolff  (1965)  in  an  unpublished  manuscript  developed  a  stationary  linear  first 

order  autoregressive  process  (i.e.,  one  which  satisfies  the  equation 

X  =pX  .  +  e   ,  n>l;  -Kp<l)  with  a  Laplace  marginal  distribution  for  the  X  's 
n   n-1    n  n 

(called  LAR(l)).   Independently  Gaver  and  Lewis  (1980)  developed  a  linear 
AR(1)  process  satisfying  the  same  process  but  with  a  Gamma  marginal 
distribution,  called  GAR(l).   Subsequently  both  of  these  processes  have  been 
shown  to  be  special  cases  of  more  general  discrete  random  coefficient 
autoregressive  models  (Dewald  and  Lewis,  1985  and  Lawrance  and  Lewis,  1981  and 
1985).   Other  time  series  models  using  continuous  random  coefficients  with  a 
specified  marginal  distribution  are,  for  Gamma  distributions,  due  to  Lewis 
(1981),  Hugus  (1982),  and  Lewis,  McKenzie  and  Hugus  (1989),  and,  with  Beta 
distributions,  due  to  McKenzie  (1985).   A  first-order  autoregressive  process 
with  the  symmetric  thick  tailed  hyperbolic  secant  distribution  has  been 
discussed  by  Rao  and  Johnson  (1988).   Nicholls  and  Quinn  (1982)  discuss 
general  random  coefficient  autoregressive  processes  without  reference  to  a 
particular  marginal  distribution. 

The  Laplace  LAR(l)  model,  and  its  generalizations  to  higher  order 
correlation  structures,  was  put  forward  by  Dewald  and  Lewis.  (1985)  as  a  model 
where  two-sided  symmetrical  random  variables  had  larger  kurtosis  and  longer 
tails  than  could  be  expected  from  Gaussian  time  series.   A  particular  example 
is  that  of  position  errors  in  a  large  navigation  system  (Hsu,  1979)  which  were 


found  to  have  Laplace  distributions.   Again  the  N-S  or  E-W  components  of  wind 
velocity  data  are  often  symmetric  and  long- tailed,  especially  in  the  tropics. 
For  a  summary  of  these  applications  and  a  summary  of  methods  of  generating 
non-normal  time  series,  see  Lewis  (1985)  and  Dewald  and  Lewis  (1985). 

Another  application  of  the  Laplace  models  arises  when  positive-valued 
Gamma  time  series  are  differenced  to  remove  trends.  The  resulting  marginal 
random  variables  are  two-sided  and  result  in  the  5-Laplace  family  of 
distributions  which  we  consider  in  this  paper. 

In  Section  2,  we  introduce  this  ^-Laplace  family  of  distributions.  These 
infinitely  divisible,  additive,  symmetric  distributions  have  extremely  thick 
tails  for  small  values  of  the  parameter  €   and  approach  a  normal  distribution 
as  €   increases.   A  square-root-Beta-Laplace  transform  is  introduced  which 
allows  us  to  transform  one  member  of  the  5-Laplace  family  into  another  in  a 
simple  manner. 

Section  3  introduces  5-Laplace  processes  with  first  order  autoregressive 
structure.   These  are  Markov  processes  with  the  geometric  autocorrelation 
function  which  is  typical  of  the  Gaussian,  first  order  autoregressive  (AR(1)) 
process.   The  basic  structure  is  a  random  coefficient  autoregression,  and  we 
generalize  the  structure  of  this  model  in  Section  4  to  encompass  moving 
average  (MA)  and  mixed  first-order  autoregressive,  qth-order  moving  average 
(ARMA)  processes.   Again,  both  the  structure  of  the  process  and  their 
autocorrelation  functions  mirror  those  of  the  Gaussian  ARMA  processes.   We 
also  briefly  compare  these  processes  with  others  which  may  be  obtained  by 
differencing  the  random  coefficient  Gamma  processes  of  Lewis,  McKenzie  and 
Hugus  (1989).   Finally,  we  show  that  a  most  important  application  of  ^-Laplace 
processes  lies  in  the  modelling  of  differenced,  possibly  non-stationary  Gamma 


processes. 

In  Sections  4  and  5,  we  consider  the  very  important  and  particular  case 
5=1,  i.e.  the  Laplace  distribution.   The  first  order  autoregression,  the 
l-BELAR(l)  process,  is  discussed  in  some  detail  and  parameter  estimates  and 
their  properties  are  discussed  and  assessed  both  theoretically  and  using 
simulation.  ' 

2.  THE  5-LAPLACE  FAMILY  OF  DISTRIBUTIONS 

2.1  £ -Lap lace  Random  Variables 

The  probability  density  function  of  a  Laplace  distributed  random 

variable,  L,  has  two  parameters a  location  parameter,  -«>  <  \x   <  +  °°,  and  a 

scale  parameter,  X  >  0  so  that 

fL(x,u.X)  =  ^  exp{-|x-*i|/X}  .  -»  <  x  <  +  oo.  (2.1) 

We  assume  without  loss  of  generality  that  ji  =  0  and  X  =  1  throughout  the 
remainder  of  this  discussion. 

The  characteristic  function  of  the  standard  Laplace  random  variable 
(fx  =  0  and  X  =  1)  is 

(J^(w)  =  (1  +  w2)"1  .  -«>  <  w  <  +  oo.  (2.2) 

It  is  well-known  that  the  standard  Laplace  distribution  belongs  to  the  class 
of  infinitely  divisible  distributions.  Thus 


(J^(w)  =  (1+w2)  e   ,  2  >  0,   -co<u<  +  ».  (2.3) 


is  the  characteristic  function  of  a  random  variable.   In  fact  it  is  the 
difference  of  two  independent,  identically  distributed,  i.i.d,  Gamma(l,l) 
random  variables,  where  2   is  the  shape  parameter  and  1  is  the  value  of  the 
scale  parameter,  k,  of  the  Gamma1  variable.   If  X  has  a  characteristic  function 
given  by  (2.3),  then  X  is  said  to  be  an  5-Laplace  random  variable. 

Since  (2.3)  is  real-valued  in  w,  X  is  a  symmetric  random  variable.   It  is 
also  easily  verified  that 


E(Xn)  = 


0  if  n  is  odd, 

(2.4) 
_(k+l)k(£)k        if  n  =  2k.  k=1.2 


where  (b)k  =  b(b+l) . . . (b+k-1)  for  b  >  0. 

Thus,  in  particular,  var(X)  =  22.      From  (2.4)  we  have 


E(X4)  -  rE(X)l4  _  3  +  3/e  (2  5) 

[Var(X)r 


and  the  kurtosis  approaches  3,  as  2-*°,    which  corresponds  to  the  kurtosis  of  a 
normal  distribution. 

We  note  too  that  we  are  dealing  with  an  additive  family  of  infinitely 
divisible  distributions,  in  the  sense  that  the  shape  parameter  is  additive. 
In  particular,  if  X  and  Y  are  independent  ^.-Laplace  and  £_-Laplace 
respectively,  then  X+Y  is  (2.+2~) -Lap lace.   This  is  easily  verified  using  the 
characteristic  function  (2.3).   Equally  important  is  the  obvious  fact  that  X-Y 
is  (£,+£«) -Lap lace.   Thus  differencing  keeps  one  in  the  same  family  of 

4 


distributions. 

2.2  The  £ -Lap lace  Density 

Since  X  is  the  difference  of  two  i.i.d.  Gamma(£,l)  random  variables,  we 
can  obtain  the  following  density  for  X; 


00  1-2 

fx(x.«)  =  S       -^—   [gjJ^J-]    exp{-(2g+x)}dg.         (2.6) 


where 


L(x)  =  Max(O.-x)  (2.7) 


Now  if  Ms  a  positive  integer,  (2.6)  and  (2.7)  can  be  evaluated 
analytically.   If  2  =   1,  we  obtain  the  density  of  the  standard  Laplace 
distribution.   For  2  =   2,  3  and  4  the  densities  are  also  well-known 
derivations  given,  for  example,  as  textbook  problems  in  Feller  (1971).   Figure 

2.1  displays  the  densities  for  2  =   1,2,3  and  4.   Note  how  the  graphs  approach 

2 

the  shape  of  a  normal  density  with  a  =  12   as  2   increases. 

When  2   is  not  an  integer,  (2.6)  must  be  evaluated  numerically.  Figure 
(2.2)  displays  examples  of  5-Laplace  densities  for  non-integral  2.     Note  that 
for  2   <  1,  the  density  is  not  absolutely  continuous  at  zero.   In  fact  for 
2   <  0.5,  the  density  is  infinite  at  zero.   For  2   >  .5  we  have 


9     9P-1 
fx(0,£)  =  r(25-l)/{rZ(£)2Z*   }  <  <».  (2.8) 


For  details  on  integrating  (2.6)  and  (2.7)  numerically,  see  Dewald  (1985) 


i-LAPLACE  DENSITIES  FOR  INTEGRAL  X 

J*1  (STANDARD  LAPUVCE)  X'2 


Figure  2.1   Probability  density  functions  of  the  5-Laplace  random  variables 
when  5=1,2.3  and  4. 
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Figure  2.2  Probability  density  functions  of  the  5-Laplace  random  variables 
when  5=0.25,  5=0.50,  5=0.75,  5=1.50. 


2.3  The  Square  Root  Beta-Laplace  Transformation. 

Much  of  the  sequel  uses  the  square-root-Beta-Laplace  transformation.  By 
this  technique,  an  £. -Laplace  random  variable  can  be  transformed  into  an 
^--Laplace  random  variable,  where  2~  £  2^. 

Theorem-   Let  X  "*  5-Laplace  and  B  ""  Beta  (2a, 2a)   where  2   >  0,   0  <  a  <  l.a  and 
a   =  1  -  a.   It  is  assumed  that  X  and  B  are  independent. 
If  Y  =  B1/2X,  then  Y  ~  (£a)-Laplace. 

Proof:    By  conditioning  on  B,  we  obtain  the  following  expression  for  the 
characteristic  function  of  Y: 


fy(w)  =  E{exp(-iB1/2Xw)} 

=  EB[E{exp(-ib1/2o.X)}] 
=EB[(l+bW2)"^]. 


(2.9) 


Using  the  binomial  expansion  in  (2.9)  we  have 


c 


•  (2Y 


^(W)  =  ^  I  ki^""2^ bk 


(2.10) 


Interchanging  the  expectation  and  summation  in  a  convergent  power  series  gives 


*>      («)i 


4U«)  =      -r-r-   (-a2)k  E(Bk) 
k=0  K* 


(2.11) 


From  Johnson  and  Kotz  (1970)  we  have,  for  integral  k. 


8 


E(Bk)  =  (€a)k/(€)k  (2.12) 


Substituting  back  into  (2.11)  completes  the  proof  thus 


w)  =  2  — _k(V)k  =  (I-h/)**.  (2.13) 

k=0   K- 


3.  THE  5-BETA-LAPLACE  FIRST-ORDER  AUTOREGRESSIVE  PROCESS,  5-BELAR(l) 

The  ^-Laplace  random  variable  is  not  only  infinitely  divisible  but  also 
self -decomposable  (Loeve,  1963).   Thus  a  linear,  constant  coefficient 
autoregression  can  be  defined,  as  was  done  in  Gaver  and  Lewis  (1980)  for  the 
first-order  autoregressive  process,  GAR(l),  in  Gamma-distributed  random 
variables.  Moreover  the  innovation  random  variables  are  differences  of 
independent  Gamma  AR1  innovation  random  variables  (Gaver  and  Lewis,  1980)  and 
can  therefore  be  generated  by  methods  of  Lawrance  (1982)  and  McKenzie  (1987). 

Unfortunately  this  linear  ^-Laplace  process  has  the  same  "zero-defect"  as 
the  Gamma  GAR(l)  process,  which  arises  from  the  fact  that  the  innovation 
random  variable  takes  on  the  value  0  with  positive  probability,  and  this 
probability  can  be  large  if  2.   is  small  and  the  lag-one  corrrelation,  p(l).  is 
large . 

For  this  reason  we  exploit,  in  this  section,  the  square-root  Beta-Laplace 
transform  to  define  a  different  2-parameter  first-order  autoregressive  process 
in  ^-Laplace  variables.   The  first  parameter,  €,    determines  the  non-Gaussian 
symmetric  marginal  distribution.   The  second  parameter,  a,  given  the  value  of 
i,   determines  uniquely  the  lag-one  serial  correlation.   Since  the  random 


coefficient  model  is  shown  to  be  first-order  Markovian.  a  determines  the 
entire  autocorrelation  function  up  to  the  sign.  The  process  does  not  have  the 
same  "zero-defect"  as  the  linear  model. 

We  define  the  stationary  process  {X  (5)}  by  means  of  an  additive,  random 
coefficient  equation-" 


1/2     —  1/2  — 

x  (5)  =  a  *(ea.ea)  x  ,(e)  +  b   (5a. 5a)  l  (5).       (3.1) 

nv  '         n   v     '  n-lv  '    n   v     '  nv  '  vy 


where  {A  (5a, 5a)}  is  an  i.i.d.  sequence  of  Beta(5a,5a)  r.V.'sj  {B  (5a, 6a)}    is 

an  i.i.d.  sequence  of  Beta(5a,5a)  random  variables,  independent  of 

{A  (5a, 5a)},  and  {L  (5)}  is  an  i.i.d.  sequence,  independent  of  both 

coefficient  sequences,  of  5-Laplace  r.v. '  s.   Coefficient  and  innovation 

sequences  are  all  assumed  to  be  independent  of  {X,  (5)}  for  all  k  £  n-1.   The 

process  {X  (5)}  so  defined  will  be  called  the  5-Beta-Laplace  AR(1)  process. 

5-BELAR(l). 

If  it  is  assumed  that  X  _1(5)  has  an  5-Laplace  distribution,  then  by  the 

theorem  in  Section  2.3  and  the  additivity  of  the  5-Laplace  family,  so  does 

X  (5).  The  fact  that  the  process  is  Markovian  follows  by  construction.   It  is 

also  explicitly  autoregressive.   To  start  the  process  in  the  stationary 

distribution,  set  Xft(5)  =  Lft(5).   Note  also  that  the  parameter  space  for  the 

process  is  the  set  {(5,a)|5>0. 

0  i   a  i   1}. 

For  the  Beta  random  variables  A  and  B  to  be  defined  properly,  each  of 

n      n 

their  parameters  must  be  positive.   However,  when  a  =  0  or  a  =  1,  (3.1)  is  no 
longer  appropriate.   Therefore  it  is  understood  that  if  a  =  0  then  {A  }  is 
identically  zero  and  {B  }  is  identically  one.   Thus  a  =  0  implies 
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X  (2)   =  L  {€)   and  {X  (£)}  is  an  i.i.d.  sequence  of  ^-Laplace  variables.   For 
convenience,  we  denote  the  innovation  process  in  (3.1)  by  {€  } ,  i.e. 


€n  =  Bn1/2(«a,&x)Ln(*).  (3.2) 


Examples  of  sample  path  behavior  'for  selected  €   and  a   are  given  in  Figure  3.1. 
Note  that  although  the  correlation  coefficient  is  approximately  0.8  for  all 
sets  of  2   and  a,  there  are  considerable  differences  in  the  sample  path 
behaviors  as  €   changes.   In  particular  when  €   is  small  there  are  runs  of 
values  that  are  very  nearly  zero  in  magnitude,  so  that  the  process  looks  very 
sporadic.   Wind  velocity  data,  especially  in  the  tropics,  tends  to  be  of  this 
form.   Using  (3.1)  we  obtain 


p(l)  =  Corr^CO.X^CO}  =  E{An1/2(€a,ea)}  (3.3) 


ar(la+l/2)r(l+l)  ,    . 

~  r(€+l/2)  r(€a+l)'  ^*4' 


Note  that  as  a-»  1,  then  p(l)-*  1.  Similarly  as  a-*  0,  p(l)-*  0.  We  can  show 
that  (3.4)  yields  the  full  range  of  positive  correlations  in  a  one-to-one 
function  of  a   for  any  given  value  of  €.      Further,  it  is  easily  established, 
using  (3.1),  that 

p(r)  =  Corr(Xn(*).Xn_r(*))  =  p(l)|r|.         r  =  0.±  l....(3.5) 

Note  also  that  except  at  €  =   0,  p(l)  -A  cc.      Since  we  shall  frequently 
refer  to  expressions  of  the  form  of  (3.4),  we  define  the  notation 
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i-BETA-LAPLACE  AR(1):     SAMPLE  PATHS 
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Figure  3.1  Sample  path  behaviour  for  ^-Beta-Laplace  processes  for  fixed  lag- 
one  serial  correlation  of  0.8,  and  different  values  of  B. 
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-r(ea)   =  E{[Beta(&x.£a)]1/2}.  (3.6) 


We  noted  above  that  the  process  is  explicitly  autoregressive.   It  is 
autoregressive  as  well  in  the  sense  of  expectations  (Lawrance  and  Lewis,  1987) 

in  that  E(X  (£) |X  _*[€)   =  x)  is  a  linear  function  of  x.   Further,  replacing 

1/2         1/2 
A    ( . )  by  -A    ( . )  in  (3.1)  yields  the  full  range  of  negative  correlations. 

Explicit  expressions  are  very  difficult  to  obtain  for  joint  distributions 

of  consecutive  observations  in  the  £-BELAR(l)  process  for  general  i.      Later, 

we  consider  in  more  detail  the  very  important  and  more  tractable  case  of  the 

Laplace  distribution,  i.e.  €   =  1.   Here,  however,  we  briefly  note  some 

aspects  of  one  of  the  important  properties  of  such  joint  distributions,  viz. 

time  reversibility.   A  process,  {X  },  is  time-reversible  if  the  joint 

distributions  of  (X  ,X  . 1 , . . . ,X  ,  )  and  (X  ,  ,X  ,   i , . . . ,X  )  are  identical  for 
v  n  n+1      n+r'     v  n+r  n+r-1      n' 

all  r  and  n.   This  is  an  important  property  in  practice  in  the  identification 
problem.   However,  it  is  a  property  of  the  joint  distributions  and  these  are 
not  known.   Nevertheless,  we  can  demonstrate  partial  time  reversibility  with 


respect  to  both  directional  moments  and  runs  probabilities. 

It  is  straightforward  to  verify  from  (3.1)  that  E(X2  X  .  )  =  E(X  X  fe)  = 

0  for  all  n  and  k.  Thus,  {X  (€)}   given  by  (3.1)  is  time-reversible  with 

respect  to  these  directional  moments.   Further,  the  characteristic  function  of 

X  -  X  ..is  given  by 
n    n-1    &      J 


E{exp[-i<o(Xn-Xn_1)]}  =  (1  +  W2)  ia   EA{[l+(l-A1/2)2<o2]"5a}     (3.7) 


and  this  is  clearly  real  valued.   Thus,  X  -  X  ,  is  symmetric  about  zero  and 
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so  P(Xr  >  Xnl)  =  P(Xn  <  Xnl).  i.e.  (Xn(«)}  is  time  reversible  with  respect 
to  runs  probabilities. 

4.  THE  €-BETA-LAPLACE  ARMA  PROCESSES 

4.1  Introduction 

In  this  section,  we  introduce  time  series  models  that  have  5-Laplace 
marginal  distributions  and  the  correlation  structure  of  the  linear,  Gaussian, 
ARMA  processes,  although  the  order  of  the  autoregression  is  limited  to  p=l. 
As  before,  our  constructions  are  based  on  the  square-root  Beta-Laplace 
transform  and  the  additivity  of  the  5-Laplace  family.   We  begin  by  considering 
pure  moving  average  processes. 

4.2  The  first  order  moving  average  process,  5-BELMA(l). 

Let  {L  (9)}  be  an  i.i.d.  sequence  of  0-Laplace  random  variables,  where 
9  =  t/(l+P)  and  0  <  J3  <  1.  Let  p  =  1-/3.  Also  let  {C  (6/3,6/3)}  be  an  i.i.d. 
sequence  of  Beta(6/3,6/3)  variates  independent  of  {L  (6)}.   Then  the  process 


n 


{X  (£)}  generated  by 


Xn(0  =  Ln(8)  +  C^Cep, 6/3^(8)  (4.1) 


has  a  marginal  5-Laplace  distribution  and  an  MA(1)  structure  with 

0  <  Corr(X  ,X  ,)  i   0.5,  as  shown  below. 
v  n  n-1' 

The  marginal  distribution  of  {X  (£)}  follows  directly  from  the  square 
root  Beta-Laplace  transform  and  the  additivity  of  the  fi-Laplace  family. 
Further,  the  construction  of  (4.1)  defines  {X  (£)}  as  a  random  coefficient 
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moving  average  of  order  one.  Thus,  Xn(£)  and  xn_k(^)  are,  by  construction, 
independent  for  |k|  >  2.   In  addition,  we  may  easily  verify  from  (4.1)  that 

px(l)  =  Corr(Xn,Xn_1)  =  nr(0/3)/(l+/3) ,  (4.2) 

where  i   is  defined  by  (3.6).  Writing  out  (4.2)  explicitly  yields 

_£_  r(pe+i/2)r(e+i) 
pxv  ;  "  1+/3  '  r(pe+i)r (e+1/2)-  ^"S) 

Thus,  as  |5-»  0  we  have  px(l)-*  0,  and  as  /3-»  1  we  have  px(l)-»  0.5.  We  may  show 
that  Py(l)  is  a  one-to-one  function  of  /3  extending  over  the  full  positive 

range  of  correlation  for  a  moving  average  process  of  order  one,  MA(1),  namely 

1/2      1/2 
[0,0.5].   In  addition,  replacing  C    by  -C  '  yields  the  full  negative  range 

[-0.5,0].   Thus,  the  entire  range  of  possible  values  of  Py(l)  for  an  MA(1)  is 

available  for  this  model.   This  is  important  in  practice,  relating  as  it  does 

to  the  applicability  of  the  model.   The  NLMA(l)  model  of  Dewald  (1985),  which 

is  a  MA(1)  model  with  Laplace  marginals  has,  in  contrast,  p(l)  bounded  by  ± 

0.4026. 

It  is  straightforward  to  show  that  this  model,  like  the  autoregressive 

model  £-BELAR(l),  is  time  reversible  with  respect  to  runs  probabilities. 

4.3  Higher  order  moving  average  processes,  the  £-BELMA(q)  process. 
The  process  defined  by  (4.1)  may  be  extended  to  order  q  as  follows.   Let 
{L  (0)}  be,  as  before,  an  i.i.d.  sequence  of  0-Laplace  random  variables,  where 

q 

now  0  =  £/(l+2  p\)  and  0  <  p\  <  1 .  for  i  =  1 .2 q.   Also  let 

i=l  X  l 

{C   .  (0/3.  ,0/3. ) ,  i  =  1,2 q}  be  independent  sequences  of  i.i.d. 
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Beta(0/3.  ,9p\  )  random  variables  independent  of  {L  (8)}.  Then,  the  process 


{X  (£)}  generated  by 


Xn(<?)  =  Ln(0)  +  I  C*^  (9^.9^)1^(9)  (4.4) 

i 
has  a  marginal  ^-Laplace  distribution  and  MA(q)  structure.  These  properties 

follow  in  the  usual  way.   As  regards  the  autocorrelation  function  {pY(k)}.  it 

is  clear  that  pY(k)  =  0  for  |k|>q,  and  we  can  show  that 


q-k                   q 
px(k)  =  nr(epk)  +  2  T(ep1Mep1+k)}/<i+  2  P\}.    k-1.2 q.(4.5) 


It  is  interesting  to  note  that  although  (pY(k)}  is  similar  in  form  to  the 
autocorrelation  function  of  a  Gaussian  linear  MA(q)  process,  the  correlations 
are  not  identical. 

This  q-th  order  moving  average  process,  too.  is  time-reversible  with 
respect  to  runs  probabilities. 

4.4.   The  first  order  mixed  autoregressive  moving  average  process, 
«-HELAHMA(l.l). 

The  first  order  mixed  model  is  obtained  by  defining  the  process 
{X  (€)}   by  means  of  two  first  order  difference  equations,  one  corresponding  to 
the  autoregressive  (AR)  component  and  the  other  to  the  moving  average  (MA) 
component.   These  are  respectively: 


Yn(G)  =  ^  (Ga,Ga)  Vl(9)  +  Bn/2(0a.0°O  Ln(0)  (4-6> 
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and 


Xn(*)  =  Ln(B)  +  C^/2  (0/3. 9/3)  Y^jCB).  (4.7) 


where  9  =  £/(l+/3);  0  <  a,   j3  <  1;  and  {Ln(9)},  {An(9a.0a)}f  (Bn(9a,9a)}  and 
{C  (9/3,9)3)}  are  independent  sequences  of  i.i.d.  random  variates  with  the  usual 
distributions  signified  by  this  notation.   As  before,  we  may  show  that  {X  (£)} 
is  marginally  ^-Laplace  and  has  the  autocorrelation  function  of  an  ARMA(l.l) 
process.   In  particular, 


Px(k)  .  ,(ep)H9a)+MepMea)U,(ea)]k-1,  k  =  lt2 (4.8) 


Note  that  this  process  {X  (2)}  has  a  structure  which  is  determined  in  a  simple 
way  by  the  two  parameters  a  and  /3.  If  /3  =  0,  we  have  the  £-BELAR  (1)  process, 
and  if  a  -  0  the  process  reduces  to  the  £-BELMA(l)  process.  Further,  if  a  =  /3 
=  0  then  {X  (€)}   is  a  sequence  of  i.i.d.  ^-Laplace  variates. 


4.5  Other  ^-Laplace  ARMA  processes 

We  can  extend  the  models  described  in  the  previous  sections  to  an  ARMA 

(l.q)  form.   This  is  achieved  by  replacing  (4.7)  by  an  expression  similar  to 

(4.4)  but  involving  L  ,  L  , L    .,  and  Y   .  However,  nothing  new 

v    '  &  n   n-1       n-q+1      n-q  & 

emerges  here  and  so  we  consider  instead  some  other  5-Laplace  processes 
generated  as  the  differences  between  Gamma  processes. 

(i)  Independent  Gamma  processes. 
Suppose  that  {X  }  and  {Y  }  are  independent  Gamma  processes  with 
Gamma(£,l)  marginal  distributions  and  identical  ARMA  structures.   Consider  the 
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process  {Z  }  given  by  Z  =  X  -  Y  .  As  noted  in  Section  2,  the  marginal 

distribution  of  {Z  }  is  5-Laplace  and  the  process  shares  the  same 

autocorrelation  function  as  {X  }  and  (Y  }.  On  the  other  hand,  {Z  }  is  not 

n       n  n 

structurally  the  same  as  these  two  processes.   Thus,  this  approach  allows  us 
to  construct  ^-Laplace  processes  with  ARMA  autocorrelation  functions  but  not 
the  simple  linear  additive,  randbm  coefficient  ARMA  structure  of  the  5-BELARMA 
processes. 

(ii)  Serial  differences  in  the  BGARMA  processes. 
A  feature  of  the  BGARMA  processes  of  Lewis,  McKenzie  and  Hugus  (1989) 
which  is  relevant  here  is  that  the  first  order  AR  and  MA  processes,  BGAR(l) 
and  BGMA(l).  have  the  same  form  of  bivariate  distribution  for  two  consecutive 
observations.  If  the  BGAR(l)  or  BGMA(l)  process  is  {X  }  and  is  marginally 
Gamma(£,l),  the  joint  characteristic  function  of  X  and  X  _1  is 


*X  X   (U,U)  =  (1+lu>  €f>   (1+iu)  €P   (1+iu+iu)  ep, 
n'  n-1 


where  p  =  corr  (X  ,X  _1 )  in  both  processes,  and  p  =  1-p.   Now,  the  process  of 

first  differences,  AX  =  X  -X  ,,  has  its  marginal  distribution  given  by  the 

n    n  n-1  &  ° 

characteristic  function 


*AX(U)  =  *X  X   (U'"U)  =  (1+u2)  iP- 
n'  n-1 


Thus.  {AX  }  is  marginally  (£p)-Laplace.   Furthermore,  if  {X  }  ~  BGMA(l) 
originally,  then  {AX  }  is  an  (£p)-Laplace  with  MA(2)  correlation  structure. 
In  fact.  PAX(1)  =  -(l-2p)/2(l-p)  and  PAY(2)  =  -p/2(l-p).   If  {X^}  "  BGAR(l) 
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originally,  then  {AX  }  is  an  (^p)-Laplace  process  with  ARMA(l.l)  correlation 

1      k-1 
structure.   In  this  case,  Piv(k)  =  ~  o"  (1-P)P   .  k  £  1.   Again,  these  first 

differences  processes  do  not  have  the  simple  ARMA  structure  of  the  processes 

presented  earlier,  although  they  may  share  their  marginal  distributions  and 

autocorrelation  functions. 


4.6.  The  £ -Lap lace  ARIMA  processes 

In  the  practical  application  of  the  Gaussian  ARMA  processes  it  has  been 
found  extremely  useful  to  extend  the  basic  models  to  processes  {X  }  which  are 
non- stationary  but  whose  first  (or  higher  order)  differences  are  an  ARMA 
process.   Such  processes  are  called  integrated  ARMA  processes  and  we  refer  to 
such  an  {X  }  as  an  ARIMA  process.   This  is  a  trivial  extension  mathematically, 
but  it  would  be  difficult  to  overestimate  the  extra  range  of  applicability  it 
bestows  in  practice  to  time  series  models.   The  approach  is  discussed  in 
detail  by  Box  and  Jenkins  (1976)  and  has  been  found  enormously  useful. 

Defining  integrated  processes  presents  no  problems  for  Gaussian  processes 
which  retain  their  marginal  distributions  under  simple  linear  transformations, 
even  of  dependent  data.   However,  if  a  non- stationary  time  series  is  thought 
to  be  marginally  Gamma  it  is  by  no  means  clear  just  how  its  differences  might 
be  distributed.   We  have  detailed  a  number  of  occasions  here  in  which 
differenced  Gammas  have  the  ^-Laplace  distribution  and  we  would  argue  that  it 
is  not  unreasonable  to  assume  that  they  will  often  be  so  in  practice,  at  least 
approximately.   Thus,  in  attempting  to  model  a  non- stationary  Gamma  process 
whose  differences  seem  stationary,  we  would  consider  modelling  the  differenced 
process  as  an  ^-Laplace  ARMA  process.   We  would  thus  argue  that  such  5-Laplace 
ARIMA  processes  are  potentially  of  great  practical  importance. 
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5.  THE  FIRST-ORDER  BETA-LAPLACE  AUTOREGRESSIVE  PROCESSES.  l-BELAR(l) 

5.1  Construction  and  Correlation  Structure 

In  this  section  we  will  study  in  detail  the  first-order  5-Beta  Laplace 
process  with  €   =  1  (i.e..  l-BELAR(l)).  This  process  is  far  more  tractable 
than  the  general  case,  and  is  an  interesting  alternative  to  the  autoregressive 
NLAR(l)  process  introduced  by  Dewald  and  Lewis  (1985).  Thus  we  set  i   =  1  in 
(3.1)  to  obtain 


1/2 
X  =  A   (a.l-a)X  ,  +  e  .  (5.1) 

n    n  v     '  n-1    n  v   ' 


where  (e  }  is  i.i.d.  sequence  with  e   ~  (l-a)-Laplace.   Also  {X  }  has  a 
marginal  distribution  that  is  standard  Laplace. 

The  only  parameter  in  the  model  is  a.  which  describes  the  dependency 
structure  of  the  process,  as  given  by  (3.4)  and  (3.5).  when  8   =  1,  as 

_  rar(a+l/2)  "]|r| 
P(r)  -    |_r(3/2)r(a+l)J    '  ^ '*> 

for  r  =  0.  ±  1.  ±  2 • 

Now  there  are  infinitely  many  other  first-order  AR(1)  processes  with 
identical  autocorrelation  functions  and  standard  Laplace  marginals,  in 
particular  the  NLAR(l)  processes  by  Dewald  and  Lewis  (1985).   Dewald  (1985) 
developed  fourth-order  analyses  of  the  NLAR(l)  model  based  on  the  linearized 
residual , 
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Rn  =  Xn  -  p^DX^  (5.3) 

to  differentiate  between  different  parametric  cases  of  the  model.  The  use  of 
higher-order  residual  analysis  in  random  coefficient  processes  was  established 
by  Lawrance  and  Lewis  (1987).   We  use  (5.3)  again  in  Section  6. 

i 
5.2.  The  Conditional  Density  of  X  ,  given  X  _1  =  y. 

To  find  the  conditional  density  of  X  |X  ,  =  y,  we  begin  by  deriving  the 

1/2  2 

density  of  A   (a,l-a)  by  differentiating  P(A  <  a  )  with  respect  to  a: 


2a-l 

2        2L^ 

f  1/0(a;a)  =  r,*rn      » 5—7  ,    0  <  a  <  1.  (5.4) 

Al/2       r(a)r(l-a)  ^1-a2.a 


1  /o 
To  evaluate  P{X  <  x  I  X  .,  =  y}  we  condition  on  A   (a,l-a)    thus 
1  n     ■  n-1  Ji  n  v     ' 


P{X  <  x  I  X  ,  =  y  }  =  P{A1/2  X  ,  +  e  <x|X  ,=y} 
v  n     '  n-1  J    '  x   n    n-1    n     '  n-1  J1 

=  EAl/2  <Fe  (x_ay)} 
A      n 
n 

L2(x) 

=  J*      F£  (x-ay)  f  1/2(a;a)da,         (5.5) 

Li (x)    n       A 

lv  '  n 


where  F  (x-ay)  is  the  cummulative  distribution  function  of  the  (l-a)-Laplace 

random  variable,  e  ,  and  L.(x).  for  i  =  1,2,  are  the  limits  of  integration  on 

a,  which  may  be  functions  of  x. 

Since  F  (x-ay)  changes  definition  for  negative  and  positive  values  of 
n 

(x-ay)  and  since  0  <  a  <  1,  differentiating  (5.5)  with  respect  to  x  gives  two 
branches  for  the  conditional  density  as 
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a=l 

J  f   (x-ay;l-a)f   .  9(a;a)da  x/y  I   1  or 

a=0   n          A  '*  x/y  <  0 

n  J    * 


a=x/y 

FX  X    *X'y)  =      *         fe  (x-ay:1_Gt)  f  1/2(a;a)da  +        (5.6) 
nl  n-1  a=0,    A  A 

1  n 


a=l 

J*     f£  (x-ay;l-a)f   1/g  (a;a)da.  0<x/y<l 


a=x/y    n  A 

n 


Now  f   (.;l-a)  can  be  evaluated  from  (2.6)  and  f  ./t>   ( .  ;  1-a)  from  (5.4).   The 

n  A 

n 

equation  (5.6)  is  computed  numerically,  as  given  in  Dewald  (1985). 

Note  that  (5.6),  as  a  function  of  x,  is  not  absolutely  continuous.   In 
fact  if  a  <,   1/2  and  x  =  y,  the  conditional  density  is  not  defined. 

We  use  (5.6)  in  obtaining  the  maximum  likelihood  estimates  of  serial 
correlations  through  the  parameter  a.   This  is  discussed  in  Section  6. 

6.  ESTIMATION  OF  SERIAL  CORRELATION  IN  l-BELAR(l) 

6.1  Least-Squares  Estimation  of  p(l) 

Although  the  l-BELAR(l)  process  is  not  a  linear  process  (except  when 

a   =  0)  the  linearized  residual  is  formed  as  in  (5.3).   The  process  {R  },  as 

shown  by  Lawrance  and  Lewis  (1987).  is  uncorrelated  but  not  independent.   Also 

it  can  be  seen  that  conditional  on  X  ,,  that  R  has  zero  mean  and  variance 

n-1        n 

2 
2{l-(p(l))  }.   Therefore  a  conditional  least-squares  estimator  (Nicholls  and 

Quinn,  1982)  of  p(l),  p.  ^  can  be  derived  by  minimizing  the  sum  of  squares  of 
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R  with  respect  to  p(l).   This  produces  the  usual  product  moment  estimator 
n 


n         n   2 
Pis  =  2  X<x*-1  7  2  X<-1-  (6.1) 


^  "ii  *  i-1   i:2   * 


Now  Dewald  (1985)  showed  that  the  l-BELAR(l)  process  is  a  Random 
Coefficient  Autoregressive  (RCA(l))  process  in  the  sense  of  Nicholls  and  Quinn 
(1982).  Therefore  from  Theorem  3.1  of  Nicholls  and  Quinn,  PI<5-»p(l),  and  p.  „ 
is  asymptotically  unbiased  and  normally  distributed,  with  asymptotic  variance 
given  by 


nVar(pLS)  =  1  +  5a  -  6[p(l)]2.  (6.2) 


Simulations  of  p.  ~  were  conducted  for  selected  values  of  p(l)  in  the 
l-BELAR(l)  process  using  the  Simulation  Testbed  (SIMTBED)  of  Lewis  et  al. 
(1984).   Detailed  tabulations  of  the  distribution  of  p,  ~  are  given  in  Dewald 
(1985)  for  various  values  of  a  and  different  sample  sizes. 

Dewald  (1985)  also  showed  via  simulation  that  certain  robust  estimators 
based  on  symmetric  loss  functions  of  R  ,  other  than  the  sum  of  squares,  are 
biased  and  apparently  asymptotically  biased.   These  robust  estimators  include 
the  Huber(c),  rank,  and  least  absolute  deviation  (LAD)  estimators  of  p(l). 
Note  that  these  results  for  robust  estimators  are  different  in  the  l-BELAR(l) 
process  than  in  the  linear  AR(1)  processes  for  which  these  estimators  are 
reported  to  be  consistent  and  asymptotically  unbiased,  e.g.  Denby  and  Martin 
(1979)  and  Bloomfield  and  Steiger  (1983). 

The  least-squares  estimator,  P,q.  is  used  in  the  next  subsection  as  a 
starting  value  in  an  iterative  procedure  to  find  the  maximum  likelihood 
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estimator,  Putj?.  of  p(l) 


6.2  Maximum  Likelihood  Estimation 

The  maximum  likelihood  estimate  of  p(l)  can  be  obtained  from  the 

conditional  density  in  (5.6).  The  formula  for  the  joint  density  of  X  , 

X  , X,  is 

n-i      l 

f(x.x    ..x)  =  fx  (x   (xn|xn_1)fx   ,    (xn-l'xn-2>  •fX<xl>^6^ 

n'  n-1  n-1 •  n-2  1 

where  fY  (xt )  is  a  standard  Laplace  density. 
*1  l 

Dewald  (1985)  used  (6.3)  to  obtain  estimates  for  a   and  then  computed  pw _ 
from  the  one-to-one  function  of  a   in  (5.2).   The  least-squares  estimate  served 
as  a  good  starting  point  for  the  numerical  evaluations  to  find  p^-. 
Techniques  for  the  numerical  integration  and  search  for  this  estimation 
procedure  are  given  in  Dewald  (1985). 

The  simulation  results  in  Dewald  (1985)  indicate  that  Pwr  F  has  a  smaller 
standard  deviation  and  bias  than  does  p.  _  at  all  values  of  p(l).   Some 
particular  results  are: 

(1)  For  a  sample  size  of  n  =  175,  and  a  =  0.5,  so  that  p(l)  =  0.6366, 
the  correlation  between  the  two  estimates  is  estimated  to  be  0.393. 
The  estimated  standard  deviation  of  p.  q  is  0.0728  and  that  of  p^p 
is  0.0289. 

(2)  For  a  sample  size  of  n  =  125,  and  a  =  .11,  so  that 

p(l)  =  .1922,  the  correlation  between  the  two  estimates  is  estimated 
to  be  0.843.  The  estimated  standard  deviation  of  p.  ^  is  0.099  and 
that  of  pmF   is  0.069. 

(3)  For  a  sample  size  of  n  =  250,  and  a  =  0.844,  so  that 
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p(l)  =  -0.900,  the  correlation  between  the  two  estimates  is 
estimated  to  be  0.101.   The  estimated  standard  deviation  of  pT c  is 
0.030  and  that  of  pmF   is  0.006. 
The  number  of  replications  in  this  simulation  was  only  20,  so  the  results 
are  imprecise.   However,  it  is  clear  that  the  maximum  likelihood  estimator  of 
p(l)  is  better  than  the  least  squares  estimator,  but  the  computation  of  the 
maximum  likelihood  estimator  is  not  simple. 

Dewald  (1985)  also  gives  results  on  joint  estimates  of  p(l),  E(X  )  and 
Var(X  )  in  the  l-BELAR(l)  process. 

Also  one  should  note  that  while  the  least-squares  estimator  of  a  and  p(l) 
is  available  for  the  general  ^-BELAR(l)  process,  maximum  likelihood  estimation 
of  a   and  p(l)  is  intractable  unless  2  =   1. 
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